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Abstract 

Let {Xn,n > 1} be a sequence of stationary associated random variables. For 
such a sequence we discuss the limiting behavior of U-statistics based on kernels 
which are of bounded Hardy-Krause variation. 
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1 Introduction 

In this paper we discuss the asymptotic behavior of U-statistics which are based on 
kernels of bounded Hardy-Krause variation. Apropos our discussion, we give the 
following definition. 

Definition 1.1. (Esary et al. (1967)) A finite collection of random variables 
{Xj,l < j < n} is said to be associated, if for any choice of component-wise 
nondecreasing functions h, g : M”’ —)• M, we have, 

Cov{h{Xi,...,Xn),g{Xu...,Xn))>0 

whenever it exists. An infinite collection of random variables {Xj,j > 1} is asso¬ 
ciated if every finite sub-collection is associated. 

Associated random variables have been widely used in reliability studies, statis¬ 
tical mechanics, and percolation theory. A set consisting of independent random 
variables is associated (cf. Esary et al. (1967)). Monotonic functions of associated 
random variables are associated. For details on probabilistic results and examples 
relating to associated sequences, see Bulinski and Shashkin (2007,2009), Prakasa 
Rao (2012), and Oliveira (2012). 

* Corresponding author. 
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Given stationary associated observations {Xj, 1 < j < n}, the U-statistic Un of 
degree k {1 <k <n) based on a symmetric kernel p : —)• M is given by, 

= ( 1 - 1 ) 

^ ^ {n,k) 

where (n, k) indicates all subsets 1 < < j 2 < < jk ^ n of {1,2,..., n}. 

Let F be the distribution function of Xi and Un be the U-statistic based on the 
symmetric kernel p{xi,X 2 )- Let 0 = J p{xi,X 2 ) dF{xi)dF{x 2 )- Define, 

K2 

Pi{xi) = j p{xi,X2) dF{x2), h^^\xi) = Pi{xi) - 6, 

R 

and h^‘^\xi,X2) = p{xi,X2) - Pi{xi) - pi{x2) -|- 9- 

Then, the Hoeffding-decomposition (H-decomposition) for Un is (see, Lee (1990)) 
Un = 0 + 2Hn^ + Hn \ where Hn'^ is the U-statistic of degree j based on the kernel 
j = 1,2. When the observations are i.i.d, E{Un) = 9. 

Next, we discuss the concepts of Hardy-Krause variation and Vitali variation. 
Discussions and applications of these concepts can be found in Clarkson and Adams 
(1933), Adams and Clarkson (1934), Owen (2004) and Beare (2009). The following 
is from Beare (2009). 

Definition 1.2. The Vitali variation of a function f : [a, 6] —)• M, where [a, 6] = 
{x ^ : a < X < b}, a,b G , /c G N is defined as ||/||i/= supY^j^^j^\Ajif\. 

The supremum is taken over all finite collections of k-dimensional rectangles A = 
<i< m} such that = [a,b], and the interiors of any two rectangles 

in A are disjoint. Here, if R = [c,d], a k-dimensional rectangle contained in [a,b], 
then, Afif = X]/c{i 2 fc}(“l)^^^/(®/)j where, xj is the vector in whose i^^ ele¬ 

ment is given by Ci if i G I, or by di if i ^ I, f(ji = f{b). For instance, if k = 2 and 
R = [ci,di] X [c 2 ,d 2 ] then, Auf = f{di,d 2 ) - f{ci,d 2 ) - /(di,C 2 ) -|- /(ci,C 2 ). 

Definition 1.3. The Hardy-Krause variation of a function f : [a,b] —)• M, [a,b] = 
{xeW^ a < x<b}, a,b e R'" , k G N, is given by, \\f\\HK= Y.(Dj^ic{i,...,k}\\fi\\v- 
Here, given a non-empty set I C {1, 2,..., k}, fj denotes the real valued function on 
Y[i(zi[ai, b i] obtained by setting the argument of f equal to bi whenever i ^ F 

When k = 1, the Hardy-Krause variation is equivalent to Vitali variation and 
hence the standard definition of total variation. 

If / : [a, i)] ^ M, [a, 6] = {x G : a < x < b}, a,b ^ A; G N is of bounded 
Hardy-Krause variation, then for any x G (a, 6] there exists a value, denoted by 
f~{x) such that f{xm) —^ f~{w) for any sequence of points {xm} £ [o, x) that 
converges to x. Set f~{x) = f{x) for x 0 (a, 6]. f~ is referred as the left-hand limit 
of /. If fi = fy for all non-empty I C {1, ...,n}, then we say / is left-continuous. 
Similarly, the right-hand limit /+ and the right-continuity of / can be defined. 

The paper is organized as follows. Section 2 includes results and definitions that 
will be required to prove our main results in section 3. In section 3 of the paper. 
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we obtain a central limit theorem for U-statistics based on functions of bounded 
Hardy-Krause variation for stationary associated random variables. In section 4, 
we apply our results to obtain the asymptotic distribution of Gini’s mean difference. 
We give simulation results in section 5 to investigate the asymptotic normality of 
the statistic under the dependent setup. 


2 Preliminaries 


In this section, we give results and definitions which will be needed to prove our 
main results given in section 3. 


Lemma 2.1. (Newman (1980)) Let {Xn,n >1} he a strictly stationary sequence 
of associated random variables. Let = Var(Xi) + 2 ^^2 Cov{Xi, Xj) with 
0 < (T^ < 00 . Then, 


y/na 


'^{Xj - E{Xj)) 4 iV(0,1) asn ^ 00 . 


i=i 


( 2 . 1 ) 


Definition 2.2. (Newman (1984)) / and fi be two complex-valued functions 

on M”, n G N. We say / /i if fi — i2e(e®“/) is coordinate-wise nondecreasing 
for every a G R. If f and fi are two real-valued functions on R”, then f ^ fi iff 
fi + f and fi — f are both coordinate-wise nondecreasing. If f fi, then fi will 
be coordinate-wise nondecreasing. 


Lemma 2.3. (Newman (1984) Let {Xn,n >1} be a stationary sequence of as¬ 
sociated random variables. For each j, let Yj = f{Xj) and Yj = f{Xj). Sup¬ 
pose that f f. Define = Var(Yi) + 2 ^^2 C'ou(Yi, Y^). Let cr^ > 0 and 
0 < Y)'^iCov{Yi,Yj) < 00 . Then, 


y/na ^ 

1=1 


Y,iyj - E{Yj)) ^ NiO,l) asn 


00 , 


( 2 . 2 ) 


Lemma 2.4. (Lebowitz (1972)) Let the random variables {Xj, 1 < j < n} be asso¬ 
ciated. Define, for A and B subsets of {1,2, ...,n} and real xj^s. 


HA,B{xj,j G AUB) = P[Xj > Xj-,j G AuB]-P[Xk > Xk,k £ A]P[Xi > xi,l £ B]. 


Then, 

0 < Ha,b < 

ieA jeB 

Lemma 2.5. (Demichev (2014)) Suppose X andY are associated random variables 
with bounded continuous densities and {X,Y) £ L 2 . Then, for any 0 < 5 < 1/2 
there exists C = C{6) such that, 

sup\P{X <x,Y < y)-P{X < x)P{Y < y)\< C[Cov{X,Y)]^. (2.3) 
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Lemma 2.6. (Beare (2009)) Let Z he a random vector taking values in a bounded 
u + v-dimensional rectangle R = [a, 6] C u, u G N. R is chosen such that each 

Zi is equal to Oi with probability zero. For a non-empty set K C {1, ...,u + v}, let 
Rk = and let Fk denote the joint distribution of those Z^ for which 

k & K. Let Fdj = 1. X = (Zi,Z^) and Y = {Zu+i, ■■■, Zu+v) o,nd the two real 
functions f and g are defined on R{i^,„^u} Suppose f and g are 

of bounded Hardy-Krause variation < oo) and left-continuous. If 

7 < oo is such that, \\Fjuj — FiFj\\oo< 7, for all non-empty sets I C {1,2,...,it} 
and J C {it + 1, It + 2,..., it + i;}, then we have, 

\Cov{f{X),g{Y))\<j\\f\\HK\\9\\HK. 


Lemma 2.7. (Gary and Dewan (2015)) Let {X„,n > 1} 6e a stationary sequence 
of associated random variables. For each), letYj = f{Xj) andYj = f{Xj) such that 
f f. Let E{Yi) = fi and 0 < E{Y^) < oo. Suppose {in,n' > 1} is a sequence of 
positive integers with I < in ^ n and in = o(n) as n ^ oo. Set Sj{k) = 

^ Yl]=i Yi- Define, {write i = in), 


B 


n 


1 

n — i-\- 1 


^ \Sj{i)-iYn\\ 

h ^ )' 


(2.4) 


Assume, C'oi'(yi, < oo. Then, 


B 


n 



in L 2 as n ^ 00 , 


where, Uj = Var{Yi) + 2 ^^2 C'oi;(li, Y)). 

Lemma 2.8. (Birkel (1988a)) Let {Xn,n >1} be a stationary sequence of associ¬ 
ated random variables with E{Xj) = 0 and \Xj\ < Ci < 00 for j > 1. Assume that 
u{n) = 2 C'ou(Xi, Xj) = 0{n~^'^~‘^^Dfi Then, there is a constant B > 0 

not depending on n such that for all n> 1, 


supE\Sn+m - 5mr< (2.5) 

m>0 

where, Sn = Yl]=i ■ 


3 Limiting behavior of U-statistics based on 
kernels of bounded Hardy-Krause variation. 

The main result of this section is Theorem 3.3. It gives the central limit theorem 
for U-statistics based on a kernel of degree 2 which is of bounded Hardy-Krause 
variation given a sequence of stationary associated random variables. The extension 
of this theorem to U-statistics with kernels of a general finite degree k > 3 is also 
discussed. We also discuss a strong law of large numbers for U-statistics based on 
such kernels using the results in Christofides (2004). 
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3.1 Central limit theorem 

Lemma 3.1. Let {X„,n >1} he a sequence of stationary associated random vari¬ 
ables with |X„|< Cl < oo, for all n > 1. Assume that the density function of Xi, 
denoted by f, is bounded. If h^‘^\x,y) is a degenerate symmetric kernel of degree 
2 which is of bounded Hardy-Krause variation and left continuous, then, under the 
condition Cov{Xi, Xj)'^ < oo, for some 0 < 7 < 1/6, 

^ \E{h^^\Xi,X,)h^^\Xu,Xi))\=o{n^). (3.1) 

l<i<j<n l<k<l<n 


Proof. Let C be a generic positive constant in the sequel. Let {Xl,i > 1} be a 
sequence of random variables independent of {Xi,i > 1} such that \X[,i > 1} are 
i.i.d with / as the marginal density function of X'^. Observe that from definition 
h^'^\x,y) is a degenerate kernel, i.e. J^h^'^\x,y)dF{x) = 0 for all y S M. Hence, 

E{h^‘^\X',Xj)h^^\Xk,Xi)) = 0. (3.2) 


Let K = {z, j, k, 1}, and for all 0 / ^4 C iiT, dehne I = ACi {z} and J = An {j, k, 1} 
{A = I U J). Let the joint distribution function of {Xa,a G S} be denoted by Es, 
for any SEA (F 0 = 1). Define Fa = Fa — FjFj. For any x = {xi,Xj,Xk,xi) G 
[—define z{x) = h1^\xi,Xj)h'^‘^\xk,xi), and za{'x.) = z{xa) where, is 
obtained by setting xt in x equal to Ci whenever t ^ A. za is then a real valued 
function on [—Ci, Observe that za is also a function of bounded Hardy- 

Krause variation and left-continuous. Let /z^^ be the signed measure generated 
by ZA- (A discussion on the construction of measures from functions of bounded 
Hardy-Krause variation can be found in Beare (2009) and the references therein.) 
Replicating the arguments of Theorem 3.1 of Beare (2009), for all 1 < z < j < n 
and 1 < k < I < n, we get. 


E{z{X,,Xj,Xk,Xi)) = E{z{X,,Xj,Xk,Xi)) - E{z{XiXj,Xk,Xi)) 




< E ll%uj|loc(|/i.,,juzl[-C,C]l«^^l) 


(3.3) 

(3.4) 


where, Ihznnjjl denotes the total variation of the measure hz^iyuj- Following the 
ideas illustrated in Theorem 4.1 of Beare (2009), (|m^aI[“C;= II-zaHv) 
where, 1 I^aI |y denotes the Vitali variation of Since a function that is of bounded 
Hardy-Krause variation is of bounded Vitali variation, 


\E{h^^\x,,X,)h^^\Xk,Xi))\<C E ll%uj||oo 

< C{Cov{Xi,Xj)^ + Cov{X„Xk)^ + Cov{X„XiY), 

(3.5) 
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for 0 < (5 = 37 < 1/2. The last inequality in (3.5) follows from Lemmas 2.4 and 
2.5. Similarly, 

< C{Cov{Xj,Xif + Cov{Xj,Xkf + Cov{Xj,Xif), (3.6) 

\E{h^^\Xi,Xj)h^^\Xk,Xi))\ 

< C{Cov{Xk,Xj)^ + Cov{Xk,XiY + Cov{Xk,Xi)^),ior 0 < 6 = 3-f < 1/2. (3.7) 


Combining (3.5), (3.6) and (3.7), 

\E{h^^Hx„Xj)h^^\Xk,Xi))\< CT^/\ (3.8) 

where, T = [Cov{Xi,XjY + Cov{Xi,XkY + Cov{Xi,XiY] x [C'ou(Xj,X*)^ + 
CoviXj,Xk)^ + Cou(X,-,X;)^] X [Cou(Xfc,Xj)'5 + Cov{Xk,X,)^ + Cou(Xfe,Xz)'^]. 
Observe that the right-hand side of the inequality in (3.8) has 27 terms, each term 
being a product of 3 covariance terms. Let r(s) = C'ou(Xi,Xi+s)'*', s > 0. From 
(3.8), stationarity and observing that r(0) < 00 {r{j) < r(0), for all j > 1), 


\E{h^^Hx,,Xj)h(^\Xk,Xi))\ 


< c 


r{\i - j\f X [r{\k - i|) r{\k - j|) -b r{\k - /|))] + r(|i - j\)r{\j - k\) 

+ r{\i - j\)r{\j -l\)+ r{\i - k\)r{\j - i|) -b r{\i - k\)r{\j - k\) + r{\i - k\)r{\j - l\) 
+ ri\i - l\)r{\j - z|) -b r{\i - l\)r{\j - A;|) -b r{\i - l\)r{\j - /|) . 


< C 


r{\i - j\)r{\k - i|) -b r{\i - j\)r{\k - j\) + r{\i - j\)r{\k - l\) 

+ r{\i - j\)r{\i - l\) + r{\i - k\)r{\j - k\) + r{\i - l\)r{\j - k\) + r{\j - l)r{\j - z|) 
+ r{\k-i\)r{\j-l\)+r{\i-l\)r{\j-l\)] = A{i,j,k,l), (say). (3.9) 


Each term in (3.9) is a product of 2 distinct covariance terms. It is easy to show 
that under 

E E A(z, j, k, 1) = o(n^) as n —>■ 00 . (3.10) 

l< 2 <_ 7 <n l<-k<l<n 

Using (3.10), (3.1) follows. □ 


Remark 3.1. The assumption of the function being of bounded Hardy-Krause vari¬ 
ation was used in writing the integral in (3.3). Other types of funetions eould also 
be considered, provided they generate appropriate measures with bounded total vari¬ 
ation. 


Remark 3.2. Using Lemmas 2.4 and 2.5, Lemma 2.6 leads to, 
|E(M2)(Xi,X,-)h(2)(Xfc,X0)| 

< C[Cov{X„Xk)^ + Cov{Xj,XkY + Cov{Xi,XiY + Cov{Xj,Xi)% 
for some 0 < (5 < 1/2. Under C'ou(Xi, Xj)*^ < 00 , 

E E \E{h^‘^\Xi,Xj)h^^^Xk,Xi))\= 0{n^), asn^QO. 

Hence, Lemma 2.6 is not enough to ensure (3.1). 
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Lemma 3.2. Let the conditions of Lemma 3.1 be true. Define Un as the U-statistic 
based on a symmetric kernel p{x, y) which is of bounded Hardy-Krause variation and 
left continuous. Letaf = Var{pi{Xi)) < oo. Define, afj = Cov{pi{Xi), pi{Xi^j)). 
Assume ^ij < Then, 

4^1 ^ 

VariUn) =—^ + oi—), where ah = a'l + 2^^ ah. (3-11) 

n n 


Proof. Let C be a generic positive constant in the sequel. Using H-decomposition, 
Var{Un) = AVariH^^I) + Var{H^^'^) + 

Since ^ Yhj=i and YjT=i^ij < oO; we get, 


1 °° 1 

Var{Hlh^) = -{af + 2j2^ij) + o{-)- 

n n 


(3.12) 


1=1 


Now, 




-2 


E{h(^\x,,Xfih^^\x,,Xi)Y 


1 <*< 1 <^ l<k<l<n 


As p is of bounded Hardy-Krause variation, so is h^‘^^ is degenerate by defini¬ 
tion). Using Lemma 3.1, 

^ ^ u{/i(2)(Xi,A,)/i(2)(Xfc,Az)} =o(n3). (3.13) 

l<i<j<n l<k<l<n 

Therefore, 

Uar(H^))<U(i/^))2 = o(i). (3.14) 

From (3.12) and (3.14), and using Cauchy-Schwarz Inequality we have, 

(3.15) 

From (3.12), (3.14) and (3.15), we have, 

Var{Un) = + o{-). (3.16) 

n n 

□ 


The following gives the central limit theorem for a U-statistic based on a 
stationary sequence of associated observations with a kernel of bounded Hardy- 
Krause variation. 

Theorem 3.3. Assume the conditions of Lemma 3.2 hold and afj > 0. Suppose 
there exists a function pi{-) such that pi <C pi and, 

OO 

'Yh^Cov{pi{Xi),pi{Xj)) < oo. (3.17) 

1=1 

Then, 

- 1 A1(0,1) as n ^ oo, where a^ is defined by (3.11). (3.18) 

2au 
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Proof. Using H-decomposition for Un, 


ni(Un-e) ^ „_i/2 y hf'l(Xj) ^ 

cru cru 

J=l 


2o'u 


In addition, 


from (3.14). Hence, 


nE{H^^)'^ —> 0 as n —)• oo. 

1/2P n 

n ' -)■ 0 as n —)• oo. 

(^u 


From (3.20) and Lemma 2.3, we get that. 


n 


- 1/2 y MLil £, iv( 0 , 1 ) as « ^ oo. 

is 


Relations (3.19), (3.20), and (3.21) prove the theorem. 


(3.19) 


(3.20) 

(3.21) 

□ 


Remark 3.3. The above results can be easily extended to a U-statistic based on 
a kernel of any finite degree k. Let Un be the U-statistic based on a symmetric 
kernel p{xi,X 2 , ■ ■ ■ ,Xk) which of finite degree k and of bounded Hardy-Krause vari¬ 
ation. Suppose af = Var{pi[Xi)) < oo, ^ Further, let 

Y^'^=iF^ov{Xi,Xj)'^ < oo, for some 0 < 7 < 1/6. Then, 

Var{Un) = ^^ + o{-). (3.22) 

n n 

If the conditions of Theorem 3.3 hold, then 

^ ^ A- iV(0,1) as n ^ 00 , (3.23) 

kau 

where afj is defined by (3.11). 


3.2 Strong law of large numbers 

Christofides (2004) showed that {Sn = (2)Un,n > k} {Un dehned by (1.1)), is a 
demimartingale when E{p) = 0 and p is component-wise nondecreasing. Using the 
concept of demimartingales, he proved a strong law of large numbers for Un under 
restrictions on moments of p. He also extended the result to U-statistics based on 
kernels p : [a, 6] —)> M where [a, b] = [ai, bfi x ... x [ok, bk] is a fc-dimensional rectangle 
and p = h — g where, h, g : [a, 6 ] —)• M are two component-wise nondecreasing 
functions and A^h > 0 and Ang > 0 (given in Definition 1.2), V R = [ci, di] x ... x 
[ck,dk] and a* < c* < dj < V i = 1,2,..., k. 

We observe that kernels which are of bounded Hardy-Krause variation fall 
into the class of kernels discussed by Christofides (2004). Hence, under restrictions 
on the moments of the kernel, as discussed in Theorem 2.1 and Lemma 2.2 of 
Christofides (2004) a strong law of numbers is true for U-statistics based on kernels 
of bounded Hardy-Krause variation. 









4 Applications 


4.1 Gini’s mean difference 

Suppose we want a measure of variability for observations from a distribution F. 
A possible index of variability is Mean difference, 9, given by, 

6= f \x — y\dF{x)dF{y). (4.1) 

R2 


Given a sample {Xj, 1 < j < n} from F, an estimator for 6 is the Gini’s mean 
difference, Un, defined by. 


Un = 


_f_ y 

n{n — 1) ^ 


p{Xi,Xj), where the kernel p{x,y) = \x — y\. (4.2) 


When the observations are i.i.d, using Hoeffding (1948) it can be shown that 

yy A- A^(0,1) as n —oo, where ^ = J\x — y\\x — z\dF{x)dF{y)dF{z). 

R3 

We now obtain the limiting distribution of Un when the observations are sta¬ 
tionary and associated. 


Theorem 4.1. Let {Xn,n > 1} be a sequence of stationary associated random 
variables having one dimensional marginal distribution function F and a bounded 
density function. Assume |X„|< Ci < oo, for all n> 1. Let, 

OO 

y] Cov{Xi,Xjy < oo, /or some 0 < 7 < 1/6. (4-3) 

i=i 


Then, 

V^{Un-9) C 

->■ iV(0,1) as n ^ oo, 

2au 

where, Un and afj are defined by (4.2) and (3.11) respectively. 

Proof. By H-decomposition, Un = 0 + 2Hn^ F Hn \ where 9 is given by (4.1). We 
observe p{x,y) is a function of bounded Hardy-Krause variation (Dehnition 1.3). 
Now, pi{x) = Xy— y\dF{y). We can choose pi{x) = Cx, for some (7 > 0 as 
pi(-) is Lipschitzian. Using (4.3) and Theorem 3.3, we have 

V^{Un-9) C 

--->■ A^(0,1) as n ^ oo. 

2cru 

Here, Var{piiXi)) =^-9^ and Cov{piiXi), pi{Xj)) = E[\Xi - X\\Xj-Y\]-9^, 
where X and Y are independent of {X„,n > 1} and are i.i.d such that F is the 
marginal distribution function of X. □ 


Remark 4.1. The above result ean be extended to random variables which are not 
uniformly bounded using the usual truneation techniques. 

The next result is needed in simulation analysis. For details, see comment (4) 
in section 5. 
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Theorem 4.2. Let {X„,n > 1} be a sequence of stationary associated random 
variables having one dimensional marginal distribution function F. Let |X„|< Ci < 
oo, for all n > 1 . Assume, 


Cov{Xi,Xj) = 0{n for some A: > 2, /c e N. (4.4) 

j=n+l 


Then, sup 

xe[-Ci,Ci] 




0 a.s as n ^ oo, where. 


bn = P), for some u > 1 and p € ( 0 , 1 ) such that |(1 + u) — {k + l)p > 1 . 


Proof. Divide the interval [—Ci,Ci\ into dn = 0{nP), for p S (0,1) small intervals 
as follows: Let —Ci = < ... < = Ci. The dn intervals are denoted 

as Im = [yni_i,yni],i = 1,2, ...,dn, each of length 6n = Let e 


sup 

xe[-Ci,Ci] 




< max sup 

* xein. 


+ max sup 

* XGln, 


For X £ Lm, 


El=ii\Xj - x\-\Xj - Xml 


= max sup 

* X&In, 


max sup 

* X&ln, 


Yf;^^{\X^-x\-E{\X^-x\)) 


EU(\^j-^nA-E{\X,-Xn^)) 


EUiEilX, - x\) - E{\X, - Xn,\)) 


= Ji +/2 + Ls (say). (4.5) 




< Hence, h<^. Similarly, I 3 < 


n5n 


nSn 


L 2 = max sup 

® xeiui 

For any e > 0, 






11 HJjJb 

i 



\EU(\^J-^-,\-Ei\Xj-Xn,\)) 
P max — - -:- 


L i 


> e 


< 


i 

Sn{Xni) 


EU(\^^-^n^-E{\Xj-Xn,\)) 


> e 


> e 


< d„max 


E\Sn{Xn,)\^ 

b^e^ 


(4.6) 


where Snixm) = EJ=i(lE' “ Xni\—E\Xj — Xml). Let H be a generic positive 
constant in the sequel. 

E\Sn{Xni)\’" 1 E\Sn{Xni) - Sn{Xni) + Sn{Xni) + Sn{Xni)\^ 


b^e^ 2^ 


b^e^ 


(4.7) 


where, Snixnf) = EJ=i E{Xj — x^)- Observe, \Xj — x|<C B{Xj — x), j > 1, for all 
X G [—Cl, Cl]. For all x G [—Ci,Ci], 


Cov{\Xi - x\-E\Xi - x\-B{Xi - x), \Xj - x\-E\Xj - x\-B{Xj - x)) 

j=n+l 

OO 00 

<bY Cov{Xi-x,Xj-x) = B Y Cov{Xi,Xj) = 

j=n-\-l j=n-\-l 
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Similarly, Cov{\Xi — x\—E\Xi — x\+B{Xi — x), \Xj — x\—E\Xj — x\+B{Xj — 

x)) = for all x G [—Ci^Ci], 

Using the equality (a+ 6)^ = Yl^=o and the Cauchy-Shwarz inequal¬ 

ity, we have, E\Sn{xni) - Snixm) + Snixm) + Snixm)]^ < Y!j=o (j)(-®|5'n(a;ni) - 
{SniXn:) - E{Sn{XnM^^E\Sn{Xn,) + CSn{Xn,) " (^n J)) 1)V2. 

Observe that both {\Xj — Xni\—E\Xj — Xni\—B{Xj — Xni)]j >1} and 
{\Xj — Xni\—E\Xj — Xni\+B{Xj — Xrn)]j > 1} form an associated sequence. Hence, 
using Lemma 2.8, we get, ■ 


sup 

xe[-Ci,Ci] 


From (4.5), we have. 


2n6n 

< — -h max 


J2-(\Xj-x^^\-E\Xj- 


x„. 


(4.8) 


P 


sup 
foehCi.Ci] 




> e 


Finally, 

OO 


< P 

< P 

sup 


2n5n e 
I bn ^21 
ACin ^ e 
2 - 


+ P 


+ B- 


max 

i 


- ^n,\-E\Xj - Xn^l) 


> 


2J 


< 


'8Cin\2 n 

" ' +H 


/80in y 
\br,dr,eJ 






(4.9) 


foe [-Cl,Cl] 




> e 


OO 

n=l 


brtdi 


\2 n’^^^dn} 

') ^ bie^ /■ 
(4.10) 

Result follows if bn = 0(n^+“/^“^), for some u> 1 and |(1-|- u) — (A; -|- l)p >1. □ 


4.2 Empirical joint distribution functions 

Let {Xn-,n > 1} be a sequence of stationary associated random variables. Bagai and 
Prakasa Rao (1991) had discussed the asymptotics for the empirical estimator of 
survival function for this sequence. Henriques and Oliveira (2003) had discussed the 
asymptotics for the histogram estimator for the two-dimensional distribution func¬ 
tion of (Xi, Xfc+i). In both the cases, the kernel is of bounded Hardy-Krause vari¬ 
ation. Similarly, the kernel of the histogram estimator for any finite k-dimensional 
distribution function is also of bounded Hardy-Krause variation. Assume further 
|X„|< C'i,n>l,0<C'i<oo. Then, for the two-dimensional distribution function 
of (Xi,Xfc+i), P(Xi < s,Xfc+i < t), s,t G M, the histogram estimator is, 

^ n—k 

Un{s, t) = -- Y] Yi,i+k ( 4 - 11 ) 

n — k 

i=\ 

where, Fi,i+fc = L(Xj < < t). Using Lemmas 2.4, 2.5 and 2.6, 

Cov{Y,^,+k, Yjj+k) < C{Cov{X,,XjY + Cov{X,+k,XjY + Cov{X,,Xj+kY+ 

Cou(Xi+fc,X,+fc)^), 
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for some 0 < <5 < 1/2. Using Lemma 2.1 and the condition Cov{Xi^Xj)^ < oo 
the asymptotic normality Unis,t) can be obtained. 


Remark 4.2. Henriques and Oliveira (2003) had obtained the asymptotic distribu¬ 
tion of the histogram estimator for stationary associated random variables belonging 
to L 2 under the covariance restriction 0 /^^^ Cov{Xi, Xj)^/^ < 00 . Replacing the 
covariance control by Demichev’s inequality (Lemma 2.5) leads to the condition on 
the covariances being less restrictive. 


5 Simulation Analysis 

The asymptotic normality of Gini’s mean difference based on stationary and asso¬ 
ciated observations are investigated via simulations. Let {Yj J > 1} be i.i.d from 
Exp{l/m) for some m € N. If Xj = min{Yj, ■ ■ ■ ,1^+m-i) for all 1 < J < n, then 
{Xj,l < j < n} forms a set of stationary associated random variables such that 
XjS are standard exponential variables {Exp{l)). Similarly, in order to obtain sta¬ 
tionary associated random variables {Xj, 1 < j < n} such that XjS are standard 
normal variables (A^(0,1)), we can set Xj = Yj + Yj^rn-i, where {Yj,j > 1} 
are i.i.d from A^(0,1/m) for some m G N. 

(1) We nse the statistical software R (http://www.r-project.org; R Development 
Core Team (2011)) for our simulations. The samples {Xj, 1 < j < n} are gener¬ 
ated as follows. 

(51) Xj = min{Yj,Yj^i), where {Yj,j > 1} are pseudo-random numbers from 
Exp{l/2) generated using rexp function in R. 

(52) Xj = min{Yj,Yj^i,Yj^ 2 ), where {l^-, j > 1} are pseudo-random numbers 
from Exp{l/3) generated nsing rexp function in R. 

(53) Xj = minfYj, • • •, l/’+g), where {Yj,j > 1} are psendo-random numbers from 
Exp{l/1Q) generated nsing rexp function in R. 

(54) Xj = Yj+Yj+i, where {Yj,j > 1} are pseudo-random numbers from W(0,1/2), 
generated using rnorm function in R. 

(55) Xj = Yj + Lj+i + ^+ 2 , where {Yj,j > 1} are pseudo-random numbers from 
A^(0,1/3), generated using rnorm function in R. 

(56) Xj = Yj + ■ ■ ■ -\- Yj^g, where {Yj,j > 1} are pseudo-random numbers from 
N{0, 1/10), generated nsing rnorm fnnction in R. 

(2) The resnlts are based on 10,000 replications and a = 0.05. 

(3) For our simulations, we use Lemma 2.7 for the estimation of au. We choose 

smallest integer less than or equal to r?!'^. 

(4) In Lemma 2.7, Yi = p\ (Xi) and 1/ = bXi, for some constant 6, 6 > 0 V i > 1, as /?i 
is lipshitz. For practical applications, the distribution function of the underlying 
population E will be nnknown. Hence, an estimator for pi{x) is needed. Let 

be analogous to Bn with Sj{k) replaced by Sj{k) = by 
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Yn = YH=ih{Xi)-, where/5i(x) = YJj=i Define, \Zi\= 2 \pi{Xi)-pi{Xi)\. 


Bn - Bn\ = 


. n—l 

—^—y 

n — I + 1^ 

j=i 


\Sj{i)-lYn 


n — £ + 


n—i 

lE 

i=i 


VI 


n—i j+l 


< 


{n-i+l)VIjZi i=j+i 




2=1 


< 


(n — ^ + 1)VI 

= 2^»pIE- 

J=1 


(n — tjl sup |/5i(a:) — /Oi(a:)| 

X 

\Xj — x\—pi{x) 


n 


l-s 


Putting s = ^, we get = 0(1) {i = in = In Theorem 4.2, putting 

A; = 10, p = 17/20, u = 11/10, and assuming Oou(Xi, Xj) = 0(n“^), 

we get, sup\Y^^=i 0 a.s as n —>■ oo. Hence, \Bn — Bn,\ —>• 0 a.s as 

X 

n ^ OO. 

In the following tables, 

a) g denotes the mean of the r = 10,000 sample Gini’s mean difference values, pi, 
1 <i <r; 

b) E.M.S.E (g) = YlViidi ~ 5 )^) where E.M.S.E denotes Estimated M.S.E; 

c) C.P (g) = f, where N = # {i : g, ^ {g - 2&n ^ ^>5 + 2^ x «)}. Here, 

Bn = ]:YVi=iBn{i), •20.025 = 1-95 9 9 64, and Bn{i), 1 < f < ?", denotes the 
estimated value for each sample; 

d) E.M.S.E {Bn) = ELi(^n(i) - 4)2; 

e) Median (g). Skewness (g), and Kurtosis (g) are the corresponding characteristics 
of the r sample statistic values. 


Table 5.1 Simulation Results for Exp{l) 


(SI) (m=2) 

o 

II 

a 

n=100 

n=200 

n=300 

n=500 

n=1000 

9 

0.9836181 

0.9930083 

0.9980009 

0.9983753 

0.9984739 

0.9989455 

E.M.S.E (g) 

0.03676916 

0.01931296 

0.009705885 

0.006434793 

0.003937328 

0.001930927 

C.P (g) 

0.8974 

0.9133 

0.9258 

0.9316 

0.9342 

0.9403 

Median (g) 

0.9711876 

0.9847915 

0.9941595 

0.9947342 

0.9978153 

0.9982964 

Skewness (g) 

0.3965101 

0.3466176 

0.2007335 

0.200636 

0.1203145 

0.08936168 

Kurtosis (g) 

3.256429 

3.219073 

3.026955 

3.002957 

3.001628 

3.020152 


(S2) (m=3) 

o 

II 

a 

n=100 

n=200 

n=300 

n=500 

n=1000 

9 

0.9732993 

0.9875498 

0.9941661 

0.9972799 

0.9972273 

0.9980701 

E.M.S.E (g) 

0.04972117 

0.02671703 

0.0133838 

0.008855159 

0.00531166 

0.002623486 

C.P (g) 

0.8804 

0.9002 

0.9201 

0.9241 

0.9296 

0.9396 

Median (g) 

0.9563219 

0.9793019 

0.9885436 

0.9928442 

0.9950708 

0.9966722 

Skewness (g) 

0.5218394 

0.3517759 

0.2756275 

0.2407178 

0.1604277 

0.1284387 

Kurtosis (g) 

3.513031 

3.167723 

3.106484 

3.15176 

3.057265 

3.060678 


(S3) (m=10) 

o 

II 

c 

n=100 

n=200 

n=300 

n=500 

n=1000 

9 

0.8758573 

0.93135 

0.9627473 

0.9779133 

0.9869055 

0.9940865 

E.M.S.E (g) 

0.1280773 

0.07158817 

0.03887426 

0.02633781 

0.0164874 

0.008295358 

C.P (g) 

0.7579 

0.8134 

0.8478 

0.8723 

0.8889 

0.9097 

Median (g) 

0.8260538 

0.9033452 

0.949436 

0.9671378 

0.9795772 

0.9908148 

Skewness (g) 

0.9151533 

0.6293821 

0.4272413 

0.3993889 

0.3150814 

0.2282979 

Kurtosis (g) 

4.462582 

3.537275 

3.239824 

3.336504 

3.231984 

3.017788 
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Table 5.2 Simulation Results for N{0^ 1) 


(54) (m = 2) 

n=50 

n=100 

n=200 

n=300 

n=500 

n=1000 

9 

1.114259 

1.120531 

1.125395 

1.126033 

1.127356 

1.127414 

E.M.S.E (g) 

0.01965189 

0.00962333 

0.004895764 

0.003354603 

0.001963522 

0.0009780631 

C.P (g) 

0.9077 

0.9262 

0.9317 

0.9327 

0.9413 

0.9441 

Median (g) 

1.111408 

1.119302 

1.124416 

1.124914 

1.127125 

1.127346 

Skewness (g) 

0.1287656 

0.1264063 

0.09834354 

0.06234944 

0.07745284 

0.04384436 

Kurtosis (g) 

2.993197 

2.923884 

2.967621 

2.907618 

2.947061 

2.880582 


(S5) (m =3) 

n=50 

n=100 

n=200 

n=300 

n=500 

n=1000 

9 

1.102877 

1.114272 

1.122001 

1.122333 

1.125143 

1.126909 

E.M.S.E (g) 

0.02673821 

0.01331181 

0.006773474 

0.004577988 

0.002688516 

0.001317823 

C.P (g) 

0.8923 

0.9173 

0.9259 

0.9317 

0.9425 

0.9483 

Median (g) 

1.096844 

1.11083 

1.119453 

1.121584 

1.124423 

1.126417 

Skewness (g) 

0.2345972 

0.1703191 

0.1451524 

0.08957866 

0.05842091 

0.04335457 

Kurtosis (g) 

3.027241 

3.073061 

2.992162 

3.030902 

2.968134 

2.987061 


(S6) (m=10) 

n=50 

n=100 

n=200 

n=300 

n=500 

n=1000 

9 

1.004597 

1.064742 

1.096209 

1.10654 

1.113758 

1.121415 

E.M.S.E (g) 

0.06772895 

0.03811105 

0.02010571 

0.01393415 

0.008248388 

0.004292068 

C.P (g) 

0.7678 

0.8503 

0.8878 

0.8992 

0.9186 

0.9277 

Median(g) 

0.9787768 

1.051586 

1.088681 

1.101294 

1.112035 

1.119691 

Skewness (g) 

0.6010408 

0.4048426 

0.2736039 

0.2290044 

0.2082488 

0.1407511 

Kurtosis(g) 

3.470072 

3.252502 

3.034935 

3.044965 

3.075823 

3.000726 


Observations 

(i) Estimation of ajj- As discussed earlier, we use an estimator for ajj for simula¬ 
tions. (4) and Lemma 2.7 imply that \fnj2Bn is also a consistent estimator for 
ay. For the sample generated from Exp{\), using (51), (52), and (53), we an¬ 
alyze the performance of the estimator by comparing 2^t: j'lBn with the actual 
values (2 c 7;7). The following table shows that as the sample size increases, the 
value of bias reduces. As expected, E.M.S.E (Estimated M.S.E) also reduces 
with the increase in the sample size. For m = 2,3, the rate of convergence is 
faster than for m = 10. 


Table 5.3 Performance of Bn for Exp(l) 


(SI) (m=2), 2au = 1.393864 

n=50 

n=100 

n=200 

n=300 

n=500 

n=1000 

2^/E/2Br, 

1.113067 

1.206764 

1.26211 

1.285339 

1.312719 

1.335088 

Bias = 2| yJ'K 1‘lBn — o-i/\ 

0.280797 

0.187100 

0.131754 

0.108525 

0.081145 

0.058776 

E.M.S.E (2^AN2B„) 

0.1941304 

0.1271084 

0.0828345 

0.0641774 

0.0455669 

0.029956 

(S2) (m=3), 2au = 1.639871 

n=50 

n=100 

n=200 

n=300 

n=500 

n=1000 

2y/7j2Bn 

1.217808 

1.345743 

1.434806 

1.470134 

1.506485 

1.544193 

Bias = 2| ^/nj2B„ — crij\ 

0.422063 

0.294128 

0.205065 

0.169737 

0.133386 

0.095678 

E.M.S.E {2^AN2B„) 

0.2784683 

0.1971176 

0.1303082 

0.1002306 

1.508937 

0.046944 

(S3) (m=10), 2cru = 2.897561 

n=50 

n=100 

n=200 

n=300 

n=500 

n=1000 

2^Nj2Br, 

1.435125 

1.761425 

2.011354 

2.149753 

2.29772 

2.469504 

Bias = 2| yJ'K 1‘lBn — (T[/| 

1.462436 

1.136136 

0.886207 

0.747808 

0.599841 

0.428057 

E.M.S.E (2^AN2B„) 

0.7394865 

0.6412837 

0.4948429 

0.4134175 

0.3136523 

0.205857 


(ii) Asymptotic Normality: From Tables 5.1 and 5.2, we observe that for a fixed 
m as the sample size increases, the approximation to the normal distribution is 
better. For m = 2,3, the convergence to normality is faster, as expected, as the 
variables are “almost independent”. For m = 10, we see that the approximation 
is good only for much larger values of n. The use of the estimator of ay could 
also affect the convergence as the bias and E.M.S.E (Estimated M.S.E) reduces 
much faster for m = 2,3 than for m = 10. 
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(iii) Estimation of the mean difference: When X^s are Exp{l), the value of the 
mean difference, 9, is 1. From Table 5.1, it can be seen that when m = 2,3, the 
convergence of the mean of 10,000 sample Gini’s mean difference values to 1 is 
faster than when m = 10. This is expected as greater dependence leads to a 
slower rate. Similar results are observed from Table 5.2. Here 6 = 1.128379. 

(iv) Comparison with i.i.d setup: A comparison of the simulation results with the 
results of Greselin and Zenga (2006) who had performed the simulations for 
the statistic under the i.i.d setup, indicate that relatively larger sample sizes 
are needed for applying the asymptotic results under the dependent setup than 
under the i.i.d setup. 

6 Conclusions 

In this paper, we give the limiting distribution of U-statistics based on kernels of 
bounded Hardy-Krause variation when the underlying sample consists of stationary 
associated observations. As an application, we obtain the asymptotic distribution 
of Gini’s mean difference under the dependent setup. Simulation results performed 
for the statistic indicate that reasonable sample sizes are needed for using the 
normality approximation. Greater the dependence, larger the sample sizes needed 
for a viable use of the asymptotic normality results. 

Results for kernels which are differentiable have been discussed in Garg and 
Dewan (2015). Results for discontinuous kernels that are not component-wise 
monotonic and are not functions of bounded Hardy-Krause variations are under 
preparation. 


7 Acknowledgment 

We would like to thank the two anonymous referees for their critical comments that 
helped in improving the earlier version of the paper. 


References 

Adams C.R. and Clarkson J.A. (1934). Properties of functions f(x, y) of bounded variation. 
Trans. Amer. Math. Soc. , 36(4): 711-730 

Bagai I. and Prakasa Rao B.L.S. (1991). Estimation of the survival function for stationary 
associated processes. Statist. Probab. Lett., 12(5):385--391. 

Beare B.K. (2009). A generalization of hoeffding’s lemma, and a new class of covariance 
inequalities. Statist. Probab. Lett., 79(5):637-642. 

Birkel T. (1988a). Moment bounds for associated sequences. Ann. Probab., 16(3):1184- 

1193. 

Bulinski A. and Shashkin A. (2007). Limit theorems for associated random fields and 
related systems. Advanced Series on Statistical Science and Applied Probability, 10. 
World Scientific Publishing, New Jersey. 


15 


Bulinski A. and Shashkin A. (2009). Limit Theorems for Associated Random Variables. 
Brill Academic Pub. 

Christofides T.C. (2004). U-statistics on associated random variables. J. Statist. Plann. 
Inference, 119(1):1-15. 

Clarkson J.A. and Adams C.R. (1933). On definitions of bounded variation for functions 
of two variables. Trans.Amer. Math. Soc., 35(4):824-854. 

Demichev, V.P. (2014). An Optimal Estimate for the Covariance of Indicator Functions of 
Associated Random Variables Theory Probab. AppL, 58(4):675-683. 

Esary J.D., Proschan F., and Walkup D.W. (1967). Association of random variables, with 
applications. Ann. Math. Statist., 38(5):1466-1474. 

Garg M. and Dewan I. (2015). On limiting distribution of U-statistics based on associated 
random variables (submitted). 

Greselin F. and Zenga M. (2006). Convergence of the sample mean difference to the normal 
distribution: simulation results. Statistica & Applicazioni, 4(special issue n° 1):97~120. 

Henriques C. and Oliveira P.E. (2003). Estimation of a two-dimensional distribution func¬ 
tion under association. J. Statist. Plann. Inference, 113(1):137- 150. 

Hoeffding W. (1948). A class of statistics with asymptotically normal distribution. Ann. 
Math. Statistics, 19(3):293-325. 

Lebowitz J.L. (1972). Bounds on the correlations and analyticity properties of ferromagnetic 
ising spin systems. Comm. Math. Phys., 28(4):313--321. 

Lee A.J. (1990). U-statistics: Theory and practice. Marcel Dekker Inc., New York. 

Newman C.M. (1980). Normal fluctuations and the FKG inequalities. Comm. Math. Phys., 
74(2):119-128. 

Newman C.M. (1984). Asymptotic independence and limit theorems for positively and neg¬ 
atively dependent random variables. In Inequalities in statistics and probability (Lincoln, 
Neb., 1982), volume 5 of IMS Lecture Notes Monogr. Ser., pages 127-140. Inst. Math. 
Statist., Hayward, CA. 

Oliveira P. (2012). Asymptotics for Associated Random Variables. Springer, Heidelberg. 

Owen, A. (2004). Multidimensional variation for quasi-Monte Carlo. Technical Report no. 
2004-02. Department of Statistics, Standford University. 

Prakasa Rao B.L.S. (2012). Associated sequences, demimartingales and nonparametric 
inference. Probability and its Applications (New York). Birkhauser/Springer Basel AG, 
Basel. 

R Development Core Team (2011). R: A Language and Environment for Statistical Com¬ 
puting. R Foundation for Statistical Computing, Vienna, Austria. ISBN 3-900051-07-0. 


16 


